Nucleic acid labeling by Thermoanaerobacter thermohydrosulfuricus DNA polymerase I variants

ABSTRACT

An enzymatically active DNA polymerase or active fragment thereof, having at least 80% identity in its amino acid sequence to the DNA polymerase of  Thermoanaerobacter thermohydrosulfuricus  or fragment thereof, and having an amino acid alteration at position 720 in Tts Pol I or at position 426 in ΔTts Pol I or at a homologous position defined with respect to Tts DNA ploymerase I, having improved incorporation of nucleotide analogs and natural bases during DNA synthesis compared to unaltered enzyme.

BACKGROUND OF THE INVENTION

[0001] 1. Field of the Invention

[0002]Thermoanaerobacter thermohydrosulfuricus (Tts) DNA polymerase has been cloned and expressed in E. coli and purified. A U.S patent (U.S. Pat. No. 5,744,312) has been recently issued to Amersham Life Science, Inc. A significant property of this polymerase is its ability to catalyze RNA-dependent DNA polymerase activity, reverse transcriptase activity (U.S. Pat. No. 5,744,312), in addition to its DNA dependent DNA polymerase. This polymerase performs optimally at a broad temperature range from 37-65 C. with maximal activity at 60 C. These activities combined with thermostability of the enzyme offer several benefits as discussed below. Several different variants of the enzyme have been generated for utility in DNA sequencing, for use in first-strand cDNA synthesis, RT-PCR and for strand displacement amplification.

[0003] 2. Description of Related Art

[0004] DNA polymerases and Reverse Transcriptases (RTs) isolated from various organisms ranging from bacteria, viruses, archaebacteria are being successfully used in the field of molecular biology for various applications. The growth temperatures for these organisms could range from extremely low to high. Applications of enzymes derived from the organisms range from cloning, polymerase chain reaction (PCR) (U.S. Pat. No. 4,683,195, Mullis et. al.), DNA sequencing, mutagenesis, genomic library construction, and nucleic acid labeling such as cDNA labeling for micro and macro arrays.

[0005] DNA Polymerases discriminate against the incorporation of unnatural bases during DNA synthesis. Most naturally occurring DNA polymerases also do not employ RNA as a template molecule. However, the natural template for a reverse transcriptase is both RNA and DNA. The natural building blocks for DNA polymerases and RTs are the four deoxy ribonucleotides (dATP, dGTP, dCTP and dTTP). Most naturally occurring polymerases and reverse transcriptases exhibit poor incorporation efficiencies towards most nucleotide analogs. The analogs could be any variants of naturally occurring dNTPs, such as ddNTPs, rNTPs, conjugates (dye or otherwise) of dNTPs and ddNTPs. This selection is important in the survival of the host. Frequent incorporation of non-natural bases would hamper subsequent rounds of replication resulting in the ultimate death of the organism. If and when polymerases do incorporate non-natural bases in their host, it is under extreme conditions that would lead to the ultimate survival of the organism. Such events however lead to mutations in the organism that may be needed for survival under extreme conditions. Therefore it is not unusual that native polymerases, having wild type amino acid sequence, either isolated directly from the host or by recombinant means exhibit a discriminatory effect towards non-natural nucleotides. Nevertheless, under very high concentrations of the analogs, native polymerases do incorporate these analogs during DNA synthesis albeit poorly. This feature is currently being exploited in all applications that use DNA polymerases or RTs for nucleic acid labeling. Consequently, the specific activity of the probe made using the naturally occurring polymerases or RTs is generally low. The current approaches to using natural enzymes for labeling encounter numerous technical difficulties. For example, incorporation of fluorescently labeled nucleotides by these naturally occurring enzymes can only be marginally improved by using excessive amounts of these labeled nucleotides in the reaction. But this imposes a different set of problems. It is generally difficult to remove the unused excess labeled nucleotides after the reaction, imposing serious problems with respect to poor signal to noise ratios. Additionally, a large amount of usually rare raw material is used to achieve marginal labeling. Apart from these problems, there is also sacrifice in the yield of the total probe generated. This is attributed to the discrimination by wild type polymerases and RTs to extend from an incorporated dNTP analog, such as a dye-dNTP. This again is a built-in feature of wild type polymerases and reverse transcriptases.

[0006] Higher specific activity probes are useful in multiple applications. This requires a facile addition of dye-dNMP followed by subsequent extension. Repeated rounds of addition of dye-dNMP and extension results in the formation of probes with higher specific activity. Since, naturally occurring polymerases and RTs are discriminatory to both addition and extension of a dNTP analog or dye-dNTP, the probes generated are of low specific activity.

[0007] As the above discussion suggests, a way of altering the natural properties of polymerases for better incorporation of nucleotide analogs during DNA synthesis, is desirable. For example, an improved ability to incorporate labeled nucleotides in various nucleic acid applications such as rolling circle amplification and single nucleotide polymorphism detection would be useful. This concern is addressed in greater detail below.

SUMMARY OF THE INVENTION

[0008] Accordingly, it is the object of the invention to provide an enzymatically active DNA polymerase having improved incorporation of nucleotide analogs and natural bases during DNA synthesis and a method of incorporating dye labeled dNTP's using the DNA polymerase or an active fragment thereof. It is a further object of the invention to provide a method of utilizing the DNA polymerase for performing direct RNA sequencing and to provide kits for labeling a polynucleotide from a DNA or RNA template with a DNA or RNA primer comprising the DNA polymerase.

[0009] The objectives are met by the present invention, which relates in one aspect to a DNA polymerase or active fragment thereof. The DNA polymerse or active fragment thereof, has at least 80% identity in its amino acid sequence to the DNA polymerase of Thermoanaerobacter thermohydrosulfuricus or a fragment thereof, and has an amino acid alteration at position 720 in Tts Pol I or at position 426 in ΔTts or at a homologous position defined with respect to Tts DNA polymerase, and has improved incorporation of nucleotide anaologs and natural bases during DNA synthesis, as compared to unaltered enzyme. In one embodiment the nucleotide analogs are dNTP, ddNTP and rNTP analogs. In a second embodiment the dNTP, ddNTP and rNTP analogs are dye-conjugated or biotin-conjugated. In a third embodiment the dye in the dye-conjugated nucleotide analogs is a rhodamine or Cyanine derivative dye. In a fourth embodiment the rhodamine dye is R110, R6G, TMR or Rox. In a fifth embodiment the Cyanine derivative dye is Cy3, Cy3.5, Cy5.0 or Cy5.5. In a sixth embodiment the DNA polymerase has the asparatate at position 720 in Tts Pol I or at position 426 in ΔTts Pol I, replaced with agrinine.

[0010] A related aspect of the invention relates to a method of utilizing the DNA polymerase of Thermoanaerobacter thermohydrosulfuricus or a fragment thereof, having an amino acid alteration at position 720 in Tts Pol I or at position 426 in ΔTts or at a homologous position defined with respect to Tts DNA polymerase, having improved incorporation of nucleotide analogs and natural bases during DNA synthesis, as compared to unaltered enzyme, for incorporating Cy3 and Cy5 dye conjugated dNTP's across a range of reaction temperatures form 37-65° C.

[0011] In a further aspect, the invention relates to a method of utilizing the DNA polymerase for performing direct RNA sequencing, while a further aspect relates to providing kits for labeling a polynucleotide from a DNA or RNA template with a DNA or RNA primer comprising the DNA polymerase.

[0012] The above objects and features of the invention will become more fully apparent when the following detailed description of the invention is read in conjunction with the accompanying figures.

BRIEF DESCRIPTION OF DRAWINGS

[0013]FIG. 1. (SEQ ID No. 1) is the Amino acid sequence of the full-length of Tts DNA polymerase I. A full-length recombinant form of the enzyme, harboring both the native 5′-3′ DNA template mediated DNA polymerase function and 5′-3′ exonuclease. (Covered under U.S. Pat. No. 5,744,312) serves as a reference amino acid sequence. In addition, the enzyme harbors reverse transcriptase activity.

[0014]FIG. 1A. (SEQ ID No. 2) is the DNA sequence of the full-length of Tts DNA polymerase I (Covered under U.S. Pat. No. 5,744,312)

[0015]FIG. 2. (SEQ ID No. 3) is the amino acid sequence of the ΔTts DNA polymerase I. A 5′-3′ exonuclease deficient (exo−) form of the enzyme, with a truncation at the amino-terminus. (Covered under U.S. Pat. No. 5,744,312). Blocked portion represents the region of the deleted amino acids from the full-length version of the enzyme.

[0016]FIG. 2A. DNA sequence of the ΔTts DNA polymerase I. (SEQ ID No. 4). (Covered under U.S. Pat. No. 5,744,312)

[0017]FIG. 3. Amino acid sequence of the F412Y variant of the ΔTts DNA polymerase I. (Covered under U.S. Pat. No. 5,744,312). Blocked portion represents the region of the deleted amino acids from the full-length version of the enzyme (Position 412 in ΔTts corresponds to 706 in full-length enzyme, phenylalanine in this position is implicated in discrimination towards ddNTP. The F412Y change facilitates easy incorporation of ddNTP. (SEQ ID No. 5)

[0018]FIG. 3A. DNA sequence of the F412Y variant of the ΔTts DNA polymerase I. (SEQ ID No. 6). (Covered under U.S. Pat. No. 5,744,312)

[0019]FIG. 4. Amino acid sequence of the ΔTtsF412YD426R variant polymerase. Blocked portion is the deleted amino acids from the full-length version of the enzyme. Position 412 in ΔTts corresponds to 706 in full-length enzyme, phenylalanine in this position is implicated in discrimination towards ddNTP. Position 426 in ΔTts corresponds to 720 in full-length enzyme. Discrimination towards both incorporation and extension of a dye conjugates of dNTP, rNTP or ddNTP is governed by aspartate residue at this position. Mutation was generated by oligonucleotide based site-directed mutagenesis technique to introduce a Δ Tts D426R change in the Tts F412Y background. (SEQ ID No. 7)

[0020]FIG. 5. Amino acid sequence of the ΔTts D426R polymerase. Blocked portion is the deleted amino acids from the full-length version of the enzyme. Position 426 in ΔTts corresponds to 720 in full-length enzyme. Discrimination towards both incorporation and extension of a dye conjugates of dNTP, rNTP or ddNTP is governed by aspartate residue at this position. (SEQ ID No. 8)

[0021]FIG. 6. Alignment of wild type Pol I sequences from different microorganisms. Homologous positions around the “finger region” of Polymerases of Pol I family are shown here. Richardson, “DNA polymerase from Escherichia coli,” Procedures in Nucleic Acid Research, Cantoni and Davies editors, Harper and Row, New York, pp. 263-276 (1966). Scopes, Protein Purification, Springer-Verlag, New York, N.Y., pp. 46-48 (1994).

[0022]FIG. 7. Improved incorporation of Dye (Cy 3.5)-dCTP and dCTP by ΔTts D426R form of ΔTts Pol I.

[0023]FIG. 8. Direct RNA sequencing by AMV RT, ΔTts and ΔTts F412Y Pol I.

[0024]FIG. 9. ΔTts F412YD426R performance in cDNA labeling using Cy3 and Cy5-dCTP and utility in microarray applications.

[0025]FIG. 10. ΔTts F412YD426R performance in cDNA labeling using Cy3 and Cy5-dUTP and utility in microarray applications.

[0026]FIG. 11. Usefulness of ΔTts F412Y or ΔTtsF412YD426R Pol I in single nucleotide primer extension (SnuPE), using RNA templates. Incorporation of dye labeled or unlabeled ddA, ddT, ddG and ddC is demonstrated here.

[0027]FIG. 12. Utility of ΔTts DNA polymerase in Rolling Circle Amplification reaction

[0028]FIG. 13. Incorporation of dye labeled nucleotide during DNA dependent DNA synthesis ΔTts, ΔTts F412Y, ΔTts F412YD426R.

[0029]FIGS. 14a & b. ΔTtsF412YD426R Performance in cDNA labeling using Cy3/Cy5-dCTP; demonstration of accurate determination of gene expression over a wide reaction temperature range (FIGS. 14a and b).

DETAILED DESCRIPTION OF THE INVENTION

[0030] The present invention discloses the utility of native DNA pol I and variant forms of DNA Pol I of Thermoanaerobacter thermohydrosulfuricus for nucleic acid labeling by fluorescent nucleotide analogs. Utility in applications such as cDNA labeling, rolling circle amplification, RNA sequencing and single nucleotide primer extension on RNA is also covered.

[0031] In this present invention we have found ways of altering the natural properties of polymerases for better incorporation of nucleotide analogs during DNA synthesis. Described here are modifications that can be introduced to the naturally occurring polymerases/reverse transcriptases to facilitate incorporation of fluorescent labeled nucleotides. The present invention identifies a single amino acid residue in polymerases that is responsible for improved incorporation of certain nucleotide analogs. A change in amino acid residue results in a profound increase in the ability of the enzyme for incorporation and extension from dye labeled nucleotides. This feature is useful in any nucleic acid application that employs fluorescent labeling by incorporation of nucleotide analog by a DNA polymerase. Such applications include labeling during DNA synthesis in various applications such as microarray analysis of gene expression. We also show the utility of some variants of the enzyme in direct RNA sequencing, rolling circle amplification, single nucleotide polymorphism detection (SNP) by single nucleotide primer extension utilizing either DNA or RNA templates. This invention relates to the wild type and mutant forms of the enzymes and their DNA sequence and amino acid sequence and the vectors that are used to generate them.

[0032] The first aspect of the invention relates to the generation and purification of a variant form of the native DNA Pol I of Thermoanaerobacter thermohydrosulfuricus and of sequences of polymerases that are at least 80% amino acid sequence identity as shown in FIG. 1 (U.S. Pat. No. 5,744,312).

[0033]FIG. 1 shows the reference sequence of the amino acid encoded by the genomic DNA between positions 1056-3674 of the Tts revealed in patent (U.S. Pat. No. 5,744,312) (SEQ ID No. 1). Enzymes have been engineered in the previous disclosure to abolish an associated 5′-3′ exonuclease function in the native enzyme and is shown here as reference sequence in FIG. 2.

[0034] For ease of reference FIGS. 1, 2 and 3 are illustrated here (covered under U.S. Pat. No. 5,744,312). FIG. 1 is a full-length recombinant form of the enzyme, harboring both the native 5′-3′ DNA template mediated DNA polymerase function and 5′-3′ exonuclease. (covered under U.S. Pat. No. 5,744,312) serves as a reference amino acid sequence as shown in FIG. 1. The full-length version of the enzyme henceforth in this document will be referred to as Tts DNA Pol I.

[0035]FIG. 2 is a 5′-3′ exonuclease deficient (exo−) form of the enzyme, with a truncation at the amino-terminus. (covered under U.S. Pat. No. 5,744,312). Henceforth this form of the enzyme will be referred to as ΔTts enzyme. The numbering of amino acids for truncated form of the enzyme begins with the first amino acid of the truncated form. Additionally in some instances numbering of amino acids in this document is also indicated on non-truncated full-length version of the enzyme for easy comparison.

[0036]FIG. 3 is an exonulease deficient truncated version form of the enzyme with an F (phenlyalanine) to Y (Tyrosine) change in the O-helix region at position 412, and is shown for reference (covered under U.S. Pat. No. 5,744,312)

[0037]FIG. 4 is the enzyme showing the introduction of a point mutation altering the Aspartate (D) residue at 426 to Arginine (R) in ΔTtsF412Y form of the enzyme. This form of the enzyme henceforth will be referred to as ΔTtsF412YD420R.

[0038]FIG. 5 shows another form of enzyme referred to as ΔTtsD426R by reversing the tyrosine (Y) residue at 412 back to phenylalanine (F) of ΔTtsF412Y form of the enzyme. Single letter amino acids are according to conventional codes used in the literature.

[0039] U.S. Pat. No. 5,744,312 shows utility for the native Tts, ΔTts, ΔTtsF412Y in applications ranging from cDNA preparation, strand displacement amplification (Walker et al., “Isothermal in vitro amplification of DNA by a restriction enzyme/DNA polymerase system,” Proc. Natl. Acad. Sci. USA 89:392-396 (1992)) and DNA sequencing.

[0040] The present application shows the utility of various forms of Tts enzyme in the incorporation of non-natural base analogs during DNA synthesis. Some examples are the incorporation of either unlabeled or dye-labeled versions of dNTPs, ddNTPs and rNTPs. DNA synthesis can be either DNA template mediated (DNA polymerase activity) or RNA template mediated (reverse transcriptase activity). DNA or RNA template-mediated cDNA probes are increasingly in demand for microarray applications. This invention demonstrates the utility of the enzyme variants in nucleic acid labeling during DNA synthesis with particular emphasis on microarray applications for gene expression studies.

[0041] Incorporation of ddNTP or ddNTP analogs is extremely useful in applications such as DNA or RNA sequencing. Herein it is demonstrated that ΔTtsF412Y and ΔTtsF412YD426R forms of the enzyme holds great promise for applications such as direct RNA sequencing and situations where single nucleotide primer extension is monitored. For example it can be seen that the F412Y variant is capable of generating excellent sequence information from short stretches of RNA compared to retroviral reverse transcriptases. Experimental results are also presented to document the utility of some enzyme variants in single nucleotide primer extension (SnuPE) applications for interrogation of target sequence of DNA or RNA backbone.

[0042] This finding is useful in applications that involve single polymorphism detection, mutation detection in DNA or RNA. Direct mutation detection at RNA level is useful in many respects. An example of such an application would be to determine drug resistance mutations in Human Immunodeficiency viral (HIV) RNA from patients undergoing drug treatments. Resistance mutations to HIV reverse transcriptase and protease inhibitors are attributed directly to mutations in the genes encoding these proteins in the RNA genome. Additionally, in humans and higher organisms improper splicing of RNA leading to defective mRNA is implicated in major disorders. Direct RNA sequencing of limited stretch such RNA or direct detection of improperly spliced RNA by mutation detection using SnuPE is feasible with the ΔTtsF412Y or ΔTtsF412RD426R variants. These would be different from current approaches that are being followed. Since retroviral RTs are not good sequencing enzymes, in current approaches a RT-PCR step is required before sequencing is undertaken.

[0043] In addition to mutation detection, Tts Pol I variants can be used for estimating RNA copy number. This has value in HIV research or gene expression studies. The enzyme's ability to incorporate dye-terminators and its potential for incorporating dye-labeled dNTP and ddNTP during cDNA synthesis can be capitalized on for estimating copy number of HIV-RNA, hence for estimating virus titer. This property of dye-labeled nucleotide incorporation by ΔTtsD426R would also be useful in mRNA quantification and gene expression studies on micro or macroarrays. The alternative strategies that are currently being employed: 1) quantitative RT-PCR used for estimating viral RNA and mRNA. 2) Branched DNA/nuclease excision for viral and mRNA quantitation.

[0044] The utility of Tts enzyme variants in strand displacement amplifications such as Rolling Circle Amplification (RCA) is also demonstrated in this invention.

EXAMPLES

[0045] The following examples serve to illustrate the utility of the subject DNA polymerases and are for illustration purposes only and should not be used in any way to limit the appended claims.

Example 1

[0046] Generation of ΔTts F412YD426R Variant (FIG. 4)

[0047] The expression vector PLS-3 harboring ΔTts F412Y variant disclosed in U.S. Pat. No. 5,744,312 served as a starting plasmid for this invention. Primers were designed to alter the codon encoding the residue 426 of ΔTts pol I from asparate to arginine. A forward primer of sequence “gggctttctcgacgccttaaaatatca” (SEQ ID No. 9)(encoding positions 422 to 430) and a complementary sequence was employed to introduce the intended point mutation. The new codon used for amino acid R was “cgc”. The primers were annealed and a cycling reaction with Pfu DNA polymerase was carried out in the presence of all four dNTPs to generate new strands. The final product was used in transformation of E. coli strain and colonies were screened individually by DNA sequencing to select for clones with desired mutation.

Example 2

[0048] Generation of ΔTts D426R Variant (FIG. 5)

[0049] A clone containing the plasmid with ΔTts F412YD426R variant shown in example 1 above served as a starting material for the generation of ΔTts D426R variant. A strategy similar to above was employed. Two primers complementary to each other were designed to introduce the intended original phenylalanine “F” residue at position 412 of ΔTts F412YD426R. A forward primer GCCGTAAATTTTGGCATAATATATGGC (SEQ ID No. 10)(to span positions 409 to 417 of the ΔTts F412YD426R polymerase) and a complementary sequence was employed to change “Y” residue were designed. A codon “TTT” for phenylalanine was employed to engineer the change.

Example 3

[0050] Alignment of Wild Type Pol I Sequences from Different Microorganisms FIG. 6.

[0051] Homologous positions in finger region of Polymerases of Pol I family are shown here. Note the alignment of amino acids corresponding to 720 of full-length (position 426 of ΔTts) Tts Pol I. The blocked region demonstrates the region of homology between the enzymes. The role of phenlyalanine at positions corresponding 706 of Tts Pol I in discrimination towards ddNTPs is shown for reference and has been documented in literature. The claims of this patent cover the role of amino acid at position 720 of full-length Tts DNA polymerase I. Alteration of this amino acid results in easy incorporation of dye-labeled nucleotide analogs. A negatively charged amino acid at this position is more discriminatory towards the incorporation of dye-labeled nucleotide. An alteration to positively charged residues such as arginine or lysine or other bulky residues results in the lowering of discrimination towards the dNTP or ddNTP conjugates. Besides the usefulness of other naturally occurring polymerases for dye nucleotide labeling, that naturally harbor residues other than glutamate or aspartate are also covered in this patent.

Example 4

[0052] Assay Conditions

[0053] The experiment shown in FIG. 7 investigates the relative efficiencies of incorporation of dye-CTP (Cy3.5-dCTP) and dCTP. Three enzyme preparations ΔTts, ΔTts F412Y, ΔTtsF412YD426R were analyzed in this experiment.

[0054] Optimized 1×buffer compositions for Reverse Transcriptase (RNA dependent DNA Polymerase) and DNA Polymerase (DNA dependent DNA polymerase) reactions for all variants of Tts Pol I are as follows. Tris, pH 8.0 (50 mM), KCl (40 mM), MgCl2 (3 mM), DTT (1 mM), DNA or RNA template (as needed), primer (5 to 50 femto mols), enzyme 0.5 to 1 units, dNTP or dNTP analog (varying concentrations as needed). In standard synthesis reactions, when full-length synthesis is monitored, 50 uM of all 4 dNTPs are included. Typical reaction volume is 10 ul. Reaction temperature was kept between 37-60 C. depending on the experimental needs. Reaction time was limited to 10 minutes for single nucleotide incorporation studies. Time was varied as needed for the purpose of the experiments, sometimes up to 1 hour if longer extensions are monitored.

[0055] The experiment shown in FIG. 7 investigates the relative efficiencies of incorporation of dye-CTP (Cy3.5-dCTP) and dCTP. Three enzyme preparations ΔTts, ΔTtsFY, ΔTtsF412YD426R were analyzed in this experiment.

[0056] In FIG. 7, Globin mRNA served as the template. A 5′ P-33 labeled primer (DNA 25-mer) was annealed to the template. Reactions were performed with varying concentrations of either Cy3.5-dCTP or dCTP alone. Inclusion of only one dNTP allowed incorporation of the next correct nucleotide alone. The sequence of the template-primer that allowed for the examination of single nucleotide “C” incorporation is shown below. CAGGCTGCCTATCAGAAGGTGGTGGCTGGTGTGGCCkATGCCCTGGCTCA 5′ 3′ mRNA (SEQ ID No. 11) TTCCACCACCGACCACACCGGTTACGG* CP-16 3′-5′ (SEQ ID No. 12)

[0057] Lanes 1,2,3 and 4 contain 20, 2, 0.2, and 0.02 uM of dNTP or dye-dNTP. Lane with no label has no enzyme, to show the integrity of the starting P-33 labeled primer. Quantification of the single nucleotide extension product is one way to tell if DR change to in the enzyme led to any consequence. “P” indicates a radio labeled primer. “P+1” represents the elongated product by a single nucleotide or nucleotide analog. P+1 migrates slowly with the dye-dNTP conjugate. Note that the migration of Cy3.5 dCMP containing bands travel slowly on the gel compared to dCTP extended products. Comparing Panel A, B and C, lanes 1 through 4 shows that the alteration of the amino acid back bone of the enzyme from D to R results in the improved efficiency of natural nucleotides. P+1 product is achieved between 10-100 fold less concentration of dCTP with the enzyme having the DR change Likewise, a comparison of panels A′, B′ and C′ reveals that the alteration DR results in improved incorporation of dye-CTP. Essentially, incorporation is achieved at lower concentrations (less than 10 times) of dye-dNTP compared to that of the wild type polymerase. It is evident that the DR enzyme was able to incorporate Cy3.5 dCTP at concentrations as low as 0.2 uM (Cy3.5 dCTP) or even lower. Compare this with the D enzyme, which exhibits relatively poor incorporation at these lower concentrations. And the results also show that this mutation dramatically reverses the decreased incorporation of dye-CTP seen with Tts F412Y in panel B′. In this experiment the template-primer is a mRNA annealed to radio labeled primer. Extension is monitored qualitatively as P+1 for the natural nucleotides and P+1* for dye labeled nucleotide. This is a promising first observation for potential use in microarray applications for cDNA probe labeling.

Example 5

[0058] Direct RNA Sequencing by AMV RT, ΔTts and ΔTtsF412Y Pol I (FIG. 8).

[0059] Globin mRNA served as a template. The 50-mer DNA was used a primer. Standard sequencing components in Amersham Pharmacia Thermo Sequenase kit were employed for sequencing. P-33 labeled terminators (ddNTP) were obtained from Amersham Pharmacia Biotech. Post-sequencing reaction products were separated on 6% urea-polyacrylamide gels. AMV, Avian Myeloblastosis virus RT.

Example 6

[0060] ΔTtsF412YD426R Performance in cDNA Labeling Using Cy3 and Cy5-dCTP and Utility in Microarray Applications (FIG. 9).

[0061] A typical 20 μl reaction Cy3 or Cy5 reaction had 1 μg of human skeletal muscle mRNA, oligo dT₍₂₅₎ and random nonamer primers and TtsFYDR polymerase enzyme in 1×reaction buffer (50 mM Tris, pH 8.0, 1 mM DDT, 40 mM KC, 100 uM dA,G and TTP and 50 um each of CTP and Cy3-dCTP or Cy5-dCTP depending on the reaction). Control mRNAs (APBiotech Inc.) of known sequence compositions were included in various concentrations to serve as dynamic range and gene expression ratio controls. Tts reactions were carried out at temperatures from 50 degrees C. Template RNA was hydrolyzed by alkali treatment and neutralized with HEPES.

[0062] Probes were purified using MultiScreen filters (Millipore) and quantified by spectrophotometry. Glass slides containing human cDNA gene targets were hybridized with equal amounts (30 pmol each) of Cy3 and Cy5 labeled cDNA probes. Slides were scanned using a GenePix® (Axon) scanner and quantified using ImageQuant software. The figure illustrates the accurate representation of probes, near even incorporation of Cy3 and Cy5 and differential gene expression in Cy3 versus Cy5 reactions.

Example 7

[0063] ΔTtsF412YD426R Performance in cDNA Labeling Using Cy3 and Cy5-dUTP and Utility in Microarray Applications (FIG. 10).

[0064] A typical 20 μl reaction Cy3 or Cy5 reaction had 1 μg of human skeletal muscle mRNA, oligo dT₍₂₅₎ and random nonamer primers and TtsFYDR polymerase enzyme in 1×reaction buffer (50 mM Tris, pH 8.0, 1 mM DDT, 40 mM KC, 100 uM dA,G and CTP and 50 um each of TTP and Cy3-dUTP or Cy5-dUTP depending on the reaction). Control mRNAs (APBiotech Inc.) of known sequence compositions were included in various concentrations to serve as dynamic range and gene expression ratio controls. Tts reactions were carried out at temperatures from 50 degrees C. Template RNA was hydrolyzed by alkali treatment and neutralized with HEPES.

[0065] Probes were purified using MultiScreen filters (Millipore) and quantified by spectrophotometry. Glass slides containing human cDNA gene targets were hybridized with equal amounts (30 pmol each) of Cy3 and Cy5 labeled cDNA probes. Slides were scanned using a GenePix® (Axon) scanner and quantified using ImageQuant software. The figure illustrates the accurate representation of probes, near even incorporation of Cy3 and Cy5 and differential gene expression in Cy3 versus Cy5 reactions.

Example 8

[0066] Usefulness of ΔTtsF412Y or ΔTtsF412YD426R Pol I in SnuPE, single nucleotide primer extension for investigation of target base on RNA (FIG. 11). Lane 1 is dNTP (G, A, T or C in panels A, B, C and D). Lane 2 is cold ddNTP. Lane 3 is a dye labeled ddNTP (linker arm length eleven carbon atoms). Lane 4 is a dye labeled ddNTP (linker arm length four carbon atoms). The dyes are from rhodamine class of FAM, R6G, TMR and Rox conjugated to ddG, ddA, ddT and ddC by either a 4-carbon or 11-carbon linkage.

[0067] The sequence of the template-primer that allowed for the examination of single nucleotide “G” incorporation is shown below. CAGGCTGCCTATCAGAAGGTGGTGGCTGGTGTGGCCAATGCCCTGGCTCA 5′ 3′ mRNA (SEQ ID No. 13) TCTTCCACCACCGACCACACCGGTTACGG* CP-18 (3′-5′) (SEQ ID No. 14)

[0068] The sequence of the template-primer that allowed for the examination of single nucleotide “A” incorporation is shown below. CAGGCTGCCTATCAGAAGGTGGTGGCTGGTGTGGCCAATGCCCTGGCTCA 5′ 3′ mRNA (SEQ ID No. 15) GTCTTCCACCACCGACCACACCGGTTACGG* CP-19 (3′-5′) (SEQ ID No. 16)

[0069] The sequence of the template-primer that allowed for the examination of single nucleotide “T” incorporation is shown below. CAGGCTGCCTATCAGAAGGTGGTGGCTGGTGTGGCCAATGCCCTGGCTCA 5′ 3′ mRNA (SEQ ID No. 17) CTTCCACCACCGACCACACCGGTTACGG* CP-17 (3′-5′) (SEQ ID No. 18)

[0070] The sequence of the template-primer that allowed for the examination of single nucleotide “C” incorporation is shown below. CAGGCTGCCTATCAGAAGGTGGTGGCTGGTGTGGCCAATGCCCTGGCTCA 5′ 3′ mRNA (SEQ TD No. 19) TTCCACCACCGACCACACCGGTTACGG* CP-16 (3′-5′) (SEQ ID No. 20)

[0071] Assays for measuring efficiency of dye-ddNTP or ddNTP by Tts variants were measured as below. A cocktail containing all reaction components except the ddNTP or dye-ddNTP was prepared as below. The reactions contained the following components. Tris, pH 8.0 (50 mM), KCl (40 mM), MgCl₂, (3 mM) DTT (1 mM) dNTP or dye dNTP 0.2 uM (lanes 1, 2, 3 and 4), 5′ labeled p-33 primer (0.2 pimol), mRNA globin Template (100 ng), enzyme in a 10-ul reaction volume.

[0072] Template-primer annealing was accomplished by treating the components at 60 C. for 10 minutes followed by slowly cooling to 37 C. to allow for proper annealing. Reactions carried out for 10 min at appropriate temperature. Reactions were terminated by addition of 6 ul of formamide-stop solution. Samples were separated and analyzed on a 16% denaturing polyacrylamide gel. The wet gel was dried on Whatmann Filter paper and imaged using Autoradiography or PhosPhor Imager.

Example 9

[0073] Utility of Either ΔTts in RCA Reaction (FIG. 12)

[0074] Isothermal Rolling circle amplification reactions were performed as below. Template circular DNA was with primers 1 (Complementary to the circle) and 2 (same polarity as the circle), with all the components including the enzyme were combined as below. The reactions were performed at 55 C. for an hour and products analyzed following separation on 1% agarose gel. A 20-ul reaction contained, Tris pH 8.0 (50 uM), KCl (40 uM), MgCl2 (3 uM), DTT (1 uM) and dNTP (400 uM), primer 1 & 2 (1 uM, each), template and enzyme 20 units. Tth pol I reactions were done at 70 C. and Bst DNA pol reactions carried out at 55 C.

Example 10

[0075] Incorporation of Dye Labeled Nucleotide During DNA Dependent DNA Synthesis ΔTts, ΔTtsF412Y, ΔTtsF412YD426R (FIG. 13).

[0076] Primer extension was monitored using a defined DNA template-DNA primer.

[0077] In this experiment the relative efficiencies of incorporation of dye-CTP (Cy3.5-dCTP) was investigated. Three enzyme preparations were tested ΔTts, ΔTtsF412Y, and ΔTtsF412YD426R

[0078] Defined DNA shown below served as the template. A P-33 labeled primer (DNA 25 mer) was annealed to the template. Reactions were performed with varying concentrations of either Cy3.5-dCTP or dCTP alone. Inclusion of only one dNTP allowed incorporation of the next correct nucleotide alone. The sequence of the template-primer that allowed for the examination of single nucleotide “C” incorporation is shown below. CAGGCTGCCTATCAGAAGGTGGTGGCTGGTGTGGCCAATGCCCTGGCTCA 5′ 3′ DNA (SEQ ID No. 21) TTCCACCACCGACCACACCGGTTACGG* CP-16 3′-5′ (SEQ hID No. 22)

[0079] Lanes 1, 2, 3, 4 and 5 contain 20, 2, 0.2, 0.02 and 0 uM dye-dCTP. Lane 5 has no enzyme, to show the integrity of the starting p-33 labeled primer. Quantification of the single nucleotide extension product is one way to tell if DR change in the enzyme led to any consequence. “P” indicates a radio labeled primer. “P+1” represents the elongated product by a single nucleotide or nucleotide analog. P+1 migrates slowly with the dye-dNTP conjugate. Note that the migration of Cy3.5-dCTP containing band travel slowly on the gel compared to dCMP extended products. Comparing Panel A, B and C, lanes 1 through 4 shows that the alteration of the amino acid back bone of the enzyme from D to R results in the improved efficiency of natural nucleotides. P+1 product is achieved between 10-100 fold less concentration of dye-dCTP with the enzyme having the DR change. Essentially, incorporation is achieved at a much lower concentration of dye-dNMP compared to the wild type enzyme. It is evident that the DR enzyme was able to incorporate Cy3.5-dCTP at concentrations as low as 0.2 uM or even lower. Compare this with the D enzyme (wild type), which exhibits relatively poor incorporation at these lower concentrations. And the results also show that this mutation dramatically reverses the decreased incorporation of dye-CMP seen with Tts F412Y in panel B. These observations demonstrate the utility of a DR variant in labeling during DNA template dependent synthesis as well.

Example 11

[0080] ΔTtsF412YD426R Performance in cDNA Labeling Using Cy3/Cy5-dCTP; Demonstration of Accurate Determination of Gene Expression Over a Wide Reaction Temperature Range (FIGS. 14a and b).

[0081] A 20 μl reaction Cy3 or Cy5 reaction had 1 μg of human skeletal muscle mRNA, oligo dT₍₂₅₎ and random nonamer primers and TtsFYDR polymerase enzyme in 1×reaction buffer (50 mM Tris, pH 8.0, 1 mM DDT, 40 mM KC, 100 uM dA,G and TTP and 50 um each of CTP and Cy3-dCTP or Cy5-dCTP depending on the reaction). Control mRNAs (APBiotech Inc.) of known sequence compositions were included in various concentrations to serve as dynamic range and gene expression ratio controls. Tts reactions were carried out at temperatures from 37, 42, 45, 50, 55, 60 and 65 degrees. For Superscript II, cDNA synthesis reactions were carried out at 42 C. (Life Technologies). Template RNA was hydrolyzed by alkali treatment and neutralized with HEPES.

[0082] Probes were purified using MultiScreen filters (Millipore) and quantified by spectrophotometry. Glass slides containing human cDNA gene targets were hybridized with equal amounts (30 pmol each) of Cy3 and Cy5 labeled cDNA probes. Slides were scanned using a GenePix® (Axon) scanner and quantified using ImageQuant software. A normalization factor of 2 (due to differences in the excitation efficiencies of Cy3 and Cy5) was applied to the observed ratio of raw flourescence signal. The figure illustrates precise determination of gene expression differences in Cy3 and Cy5 reactions. For example across all temperature ranges the normalized observed ratios were very close to the target ratios demonstrating the ability to accurately determine gene expression differences over a wide temperature range using ΔTtsF412YD426R.

Example 12

[0083] Protein Purification Protocol ΔTtsF412YD426R or ΔTtsD426R pol I Purification Scheme

[0084] The following was the protocol adapted for cells harvested from 1 L of LB media for initial enzyme evaluation studies (Typical yield of wet cells 4-6 g). E. coli cells harboring the expression vector were grown according to standard protocols as described in the original patent and harvested and kept frozen until ready for use. Cell lysis was carried out by adding 5 ml lysis buffer for every gram of wet cell paste (50 mM Tris pH 8.0, 1 mM EDTA, 50 mM NaCl, 10% Glycerol and containing 1 mg/ml lysozyme). Cells were left on ice for 40 minutes. Upon complete resuspension the cells were passed through a French Press at 15,000 PSI. After lysis, cell extract was treated at 70 C. for 10 to inactivate host enzymes. The extract was then clarified following centrifugation at 12,000 rpm for 30 minutes. The supernatant containing the enzyme fraction was then used for further purification.

[0085] The lysate was then loaded on to a Q-Sepharose HP column previously equilibrated with buffer A (Tris 50 mM (pH 7.5), EDTA 1 mM, NaCl 150 mM, 10% glycerol). The column was washed four times with buffer A. The flow rate of the buffer was 8- 10 ml per minute. This step selectively binds nucleic acid and the follow-through containing the enzyme is used in subsequent column. The flow-through sample was concentrated to small volume and removed of salt by tangential flow and diafiltration device to prepare for the next column. The sample was loaded on to a second Q-Sepharose HP column pre-equilibrated with buffer B (Tris 50 mM (pH 7.5), EDTA 1 mM, 10% glycerol). The column was washed with buffer B for three additional column volumes to remove any unbound proteins. The ΔTts F412YD426R pol I preparation was eluted by establishing a 0-30% gradient salt using NaCl. The eluted sample was dialyzed against buffer C (30 mM sodium phosphate, 30 mM sodium formate, 60 mM sodium acetate, 1 mM EDTA and 10% glycerol). The dialyzed sample was loaded on to a Resource S column previously equilibrated with buffer C. Column was washed with buffer C for three additional column volumes to remove unbound proteins. ΔTtsF412YD426R Pol I was eluted specifically using a 0-50% salt gradient using NaCl. This sample contained the purified enzyme preparation.

[0086] A similar purification protocol was employed for ΔTtsD426R Pol I.

1 32 1 872 PRT Thermoanaerobacter thermohydrosulfuricus 1 Met Tyr Lys Phe Leu Ile Ile Asp Gly Ser Ser Leu Met Tyr Arg Ala 1 5 10 15 Tyr Tyr Ala Leu Pro Met Leu Thr Thr Ser Glu Gly Leu Pro Thr Asn 20 25 30 Ala Leu Tyr Gly Phe Thr Met Met Leu Ile Lys Leu Ile Glu Glu Glu 35 40 45 Lys Pro Asp Tyr Ile Ala Ile Ala Phe Asp Lys Lys Ala Pro Thr Phe 50 55 60 Arg His Lys Glu Tyr Gln Asp Tyr Lys Ala Thr Arg Gln Ala Met Pro 65 70 75 80 Glu Glu Leu Ala Glu Gln Val Asp Tyr Leu Lys Glu Ile Ile Asp Gly 85 90 95 Phe Asn Ile Lys Thr Leu Glu Leu Glu Gly Tyr Glu Ala Asp Asp Ile 100 105 110 Ile Gly Thr Ile Ser Lys Leu Ala Glu Glu Lys Gly Met Glu Val Leu 115 120 125 Val Val Thr Gly Asp Arg Asp Ala Leu Gln Leu Val Ser Asp Lys Val 130 135 140 Lys Ile Lys Ile Ser Lys Lys Gly Ile Thr Gln Met Glu Glu Phe Asp 145 150 155 160 Glu Lys Ala Ile Leu Glu Arg Tyr Gly Ile Thr Pro Gln Gln Phe Ile 165 170 175 Asp Leu Lys Gly Leu Met Gly Asp Lys Ser Asp Asn Ile Pro Gly Val 180 185 190 Pro Asn Ile Gly Glu Lys Thr Ala Ile Lys Leu Leu Lys Asp Phe Gly 195 200 205 Thr Ile Glu Asn Leu Ile Gln Asn Leu Ser Gln Leu Lys Gly Lys Ile 210 215 220 Lys Glu Asn Ile Glu Asn Asn Lys Glu Leu Ala Ile Met Ser Lys Arg 225 230 235 240 Leu Ala Thr Ile Lys Arg Asp Ile Pro Ile Glu Ile Asp Phe Glu Glu 245 250 255 Tyr Lys Val Lys Lys Phe Asn Glu Glu Lys Leu Leu Glu Leu Phe Asn 260 265 270 Lys Leu Glu Phe Phe Ser Leu Ile Asp Asn Ile Lys Lys Glu Ser Ser 275 280 285 Ile Glu Ile Val Asp Asn His Lys Val Glu Lys Trp Ser Lys Val Asp 290 295 300 Ile Lys Glu Leu Val Thr Leu Leu Gln Asp Asn Arg Asn Ile Ala Phe 305 310 315 320 Tyr Pro Leu Ile Tyr Glu Gly Glu Ile Lys Lys Ile Ala Phe Ser Phe 325 330 335 Gly Lys Asp Thr Val Tyr Ile Asp Val Phe Gln Thr Glu Asp Leu Lys 340 345 350 Glu Ile Phe Glu Lys Glu Asp Phe Glu Phe Thr Thr His Glu Ile Lys 355 360 365 Asp Phe Leu Val Arg Leu Ser Tyr Lys Gly Ile Glu Cys Lys Ser Lys 370 375 380 Tyr Ile Asp Thr Ala Val Met Ala Tyr Leu Leu Asn Pro Ser Glu Ser 385 390 395 400 Asn Tyr Asp Leu Asp Arg Val Leu Lys Lys Tyr Leu Lys Val Asp Val 405 410 415 Pro Ser Tyr Glu Gly Ile Phe Gly Lys Gly Arg Asp Lys Lys Lys Ile 420 425 430 Glu Glu Ile Asp Glu Asn Ile Leu Ala Asp Tyr Ile Cys Ser Arg Cys 435 440 445 Val Tyr Leu Phe Asp Leu Lys Glu Lys Leu Met Asn Phe Ile Glu Glu 450 455 460 Met Asp Met Lys Lys Leu Leu Leu Glu Ile Glu Met Pro Leu Val Glu 465 470 475 480 Val Leu Lys Ser Met Glu Val Ser Gly Phe Thr Leu Asp Lys Glu Val 485 490 495 Leu Lys Glu Leu Ser Gln Lys Ile Asp Asp Arg Ile Gly Glu Ile Leu 500 505 510 Asp Lys Ile Tyr Lys Glu Ala Gly Tyr Gln Phe Asn Val Asn Ser Pro 515 520 525 Lys Gln Leu Ser Glu Phe Leu Phe Glu Lys Leu Asn Leu Pro Val Ile 530 535 540 Lys Lys Thr Lys Thr Gly Tyr Ser Thr Asp Ser Glu Val Leu Glu Gln 545 550 555 560 Leu Val Pro Tyr Asn Asp Ile Val Ser Asp Ile Ile Glu Tyr Arg Gln 565 570 575 Leu Thr Lys Leu Lys Ser Thr Tyr Ile Asp Gly Phe Leu Pro Leu Met 580 585 590 Asp Glu Asn Asn Arg Val His Ser Asn Phe Lys Gln Met Val Thr Ala 595 600 605 Thr Gly Arg Ile Ser Ser Thr Glu Pro Asn Leu Gln Asn Ile Pro Ile 610 615 620 Arg Glu Glu Phe Gly Arg Gln Ile Arg Arg Ala Phe Ile Pro Arg Ser 625 630 635 640 Arg Asp Gly Tyr Ile Val Ser Ala Asp Tyr Ser Gln Ile Glu Leu Arg 645 650 655 Val Leu Ala His Val Ser Gly Asp Glu Lys Leu Ile Glu Ser Phe Met 660 665 670 Asn Asn Glu Asp Ile His Leu Arg Thr Ala Ser Glu Val Phe Lys Val 675 680 685 Pro Met Glu Lys Val Thr Pro Glu Met Arg Arg Ala Ala Lys Ala Val 690 695 700 Asn Phe Gly Ile Ile Tyr Gly Ile Ser Asp Tyr Gly Leu Ser Arg Asp 705 710 715 720 Leu Lys Ile Ser Arg Lys Glu Ala Lys Glu Tyr Ile Asn Asn Tyr Phe 725 730 735 Glu Arg Tyr Lys Gly Val Lys Asp Tyr Ile Glu Lys Ile Val Arg Phe 740 745 750 Ala Lys Glu Asn Gly Tyr Val Thr Thr Ile Met Asn Arg Arg Arg Tyr 755 760 765 Ile Pro Glu Ile Asn Ser Arg Asn Phe Thr Gln Arg Ser Gln Ala Glu 770 775 780 Arg Leu Ala Met Asn Ala Pro Ile Gln Gly Ser Ala Ala Asp Ile Ile 785 790 795 800 Lys Met Ala Met Val Lys Val Tyr Asn Asp Leu Lys Lys Leu Lys Leu 805 810 815 Lys Ser Lys Leu Ile Leu Gln Val His Asp Glu Leu Val Val Asp Thr 820 825 830 Tyr Lys Asp Glu Val Asp Ile Ile Lys Lys Ile Leu Lys Glu Asn Met 835 840 845 Glu Asn Val Val Gln Leu Lys Val Pro Leu Val Val Glu Ile Gly Val 850 855 860 Gly Pro Asn Trp Phe Leu Ala Lys 865 870 2 2619 DNA Thermoanaerobacter thermohydrosulfuricus 2 atgtataaat ttttaataat tgatggaagt agcctcatgt acagagccta ttatgccttg 60 cccatgctta ctacaagtga gggattgcct acaaatgctc tgtatggttt tactatgatg 120 cttataaaac ttatcgagga ggaaaaacct gattacatag ctattgcttt tgacaaaaaa 180 gctcctactt ttagacacaa agaatatcaa gactacaaag ctacaagaca agctatgcct 240 gaagaacttg ctgaacaagt agactatttg aaagaaatta tagatggctt taatataaag 300 acattagaat tagaaggtta tgaagctgat gacattatag ggactatttc aaagctggca 360 gaggaaaaag gaatggaagt gcttgtagtt acaggagaca gagatgctct tcaattagtt 420 tcagataaag tgaagataaa aatttctaaa aagggtatta ctcagatgga agagtttgac 480 gaaaaggcta ttttagaaag gtatggaata actcctcagc agtttataga tttaaaaggg 540 cttatgggag ataaatctga taatatccct ggagtaccta atatagggga aaaaactgcg 600 attaagctat taaaggattt tggaacaatt gaaaatttaa tccaaaatct ttctcagctt 660 aaaggtaaaa taaaagaaaa tatagaaaac aataaagagt tagctataat gagtaagagg 720 cttgctacta taaaaagaga cattcccatt gagatagatt ttgaggagta taaagtaaaa 780 aaatttaatg aggagaagct tttagagctt tttaataaat tagaattctt tagtttaatt 840 gataacataa agaaagaaag tagcatagag attgtagata atcataaagt tgaaaaatgg 900 tcaaaagtag atataaaaga attagtaact ttgttgcaag ataacagaaa tattgctttt 960 tacccgttaa tttatgaagg ggaaataaaa aaaatagcct tttcttttgg aaaggatacg 1020 gtttatattg acgttttcca aacagaagat ttaaaggaga tttttgaaaa agaagatttt 1080 gaatttacaa cccatgaaat aaaggatttt ttagtgaggc tttcttataa aggaatagag 1140 tgtaaaagca agtacataga tactgctgta atggcttatc ttctgaatcc ttctgagtct 1200 aactatgact tagaccgtgt gctaaaaaaa tatttaaagg tagatgtgcc ttcttatgaa 1260 ggaatatttg gcaaaggtag ggataaaaag aaaattgaag agattgacga aaacatactt 1320 gctgattata tttgcagtag atgtgtgtat ctatttgatt taaaagaaaa gctgatgaat 1380 tttattgaag agatggatat gaaaaaactt ctattagaaa tagaaatgcc tcttgtagaa 1440 gttttaaaat caatggaggt aagtggtttt acattggata aagaagttct aaaagagctt 1500 tcacaaaaga tagatgatag aataggagaa atactagata aaatttataa agaggcagga 1560 tatcaattta atgtaaattc acctaagcaa ttaagtgaat ttttgtttga aaagttaaac 1620 ttaccagtaa taaagaaaac aaaaacagga tactctacgg attctgaagt tttggaacaa 1680 ttggttcctt ataatgatat tgtcagcgat ataatagagt atcggcaact tacaaaactt 1740 aaatctactt atatagatgg atttttgcct cttatggatg aaaacaatag agtacattct 1800 aattttaaac aaatggttac tgctacaggt agaataagca gcaccgagcc aaatctacaa 1860 aatataccta taagagaaga gtttggcaga caaattagaa gggcttttat tccgaggagt 1920 agagatggat atattgtttc agcagattat tctcagattg aactgagggt tttagcacat 1980 gtttcgggag atgaaaagct aatagaatct tttatgaata atgaagatat acatttaagg 2040 acagcttcgg aggtttttaa agttcctatg gaaaaagtta caccggagat gagaagagca 2100 gcaaaagccg taaattttgg cataatatat ggcataagcg attatgggct ttctcgagac 2160 cttaaaatat caagaaaaga agcaaaagag tacataaata attattttga aagatataaa 2220 ggagtaaaag attatattga aaaaatagta cgatttgcaa aagaaaatgg ctatgtgact 2280 acaataatga acagaaggag atatattcct gaaataaact caagaaattt tactcaaaga 2340 tcgcaggccg aaaggttagc aatgaatgct ccgatacagg gaagtgcggc tgatataata 2400 aaaatggcaa tggttaaggt atacaacgat ttaaaaaaat taaagcttaa gtctaagctt 2460 atattgcaag ttcatgacga gcttgtagtg gatacttata aggatgaagt agatatcata 2520 aaaaagatac ttaaagaaaa tatggaaaat gtagtgcaat taaaagttcc tctggttgtt 2580 gaaattggcg tagggcctaa ttggtttttg gccaagtga 2619 3 872 PRT Thermoanaerobacter thermohydrosulfuricus 3 Met Tyr Lys Phe Leu Ile Ile Asp Gly Ser Ser Leu Met Tyr Arg Ala 1 5 10 15 Tyr Tyr Ala Leu Pro Met Leu Thr Thr Ser Glu Gly Leu Pro Thr Asn 20 25 30 Ala Leu Tyr Gly Phe Thr Met Met Leu Ile Lys Leu Ile Glu Glu Glu 35 40 45 Lys Pro Asp Tyr Ile Ala Ile Ala Phe Asp Lys Lys Ala Pro Thr Phe 50 55 60 Arg His Lys Glu Tyr Gln Asp Tyr Lys Ala Thr Arg Gln Ala Met Pro 65 70 75 80 Glu Glu Leu Ala Glu Gln Val Asp Tyr Leu Lys Glu Ile Ile Asp Gly 85 90 95 Phe Asn Ile Lys Thr Leu Glu Leu Glu Gly Tyr Glu Ala Asp Asp Ile 100 105 110 Ile Gly Thr Ile Ser Lys Leu Ala Glu Glu Lys Gly Met Glu Val Leu 115 120 125 Val Val Thr Gly Asp Arg Asp Ala Leu Gln Leu Val Ser Asp Lys Val 130 135 140 Lys Ile Lys Ile Ser Lys Lys Gly Ile Thr Gln Met Glu Glu Phe Asp 145 150 155 160 Glu Lys Ala Ile Leu Glu Arg Tyr Gly Ile Thr Pro Gln Gln Phe Ile 165 170 175 Asp Leu Lys Gly Leu Met Gly Asp Lys Ser Asp Asn Ile Pro Gly Val 180 185 190 Pro Asn Ile Gly Glu Lys Thr Ala Ile Lys Leu Leu Lys Asp Phe Gly 195 200 205 Thr Ile Glu Asn Leu Ile Gln Asn Leu Ser Gln Leu Lys Gly Lys Ile 210 215 220 Lys Glu Asn Ile Glu Asn Asn Lys Glu Leu Ala Ile Met Ser Lys Arg 225 230 235 240 Leu Ala Thr Ile Lys Arg Asp Ile Pro Ile Glu Ile Asp Phe Glu Glu 245 250 255 Tyr Lys Val Lys Lys Phe Asn Glu Glu Lys Leu Leu Glu Leu Phe Asn 260 265 270 Lys Leu Glu Phe Phe Ser Leu Ile Asp Asn Ile Lys Lys Glu Ser Ser 275 280 285 Ile Glu Ile Val Asp Asn His Lys Val Glu Lys Trp Ser Lys Val Asp 290 295 300 Ile Lys Glu Leu Val Thr Leu Leu Gln Asp Asn Arg Asn Ile Ala Phe 305 310 315 320 Tyr Pro Leu Ile Tyr Glu Gly Glu Ile Lys Lys Ile Ala Phe Ser Phe 325 330 335 Gly Lys Asp Thr Val Tyr Ile Asp Val Phe Gln Thr Glu Asp Leu Lys 340 345 350 Glu Ile Phe Glu Lys Glu Asp Phe Glu Phe Thr Thr His Glu Ile Lys 355 360 365 Asp Phe Leu Val Arg Leu Ser Tyr Lys Gly Ile Glu Cys Lys Ser Lys 370 375 380 Tyr Ile Asp Thr Ala Val Met Ala Tyr Leu Leu Asn Pro Ser Glu Ser 385 390 395 400 Asn Tyr Asp Leu Asp Arg Val Leu Lys Lys Tyr Leu Lys Val Asp Val 405 410 415 Pro Ser Tyr Glu Gly Ile Phe Gly Lys Gly Arg Asp Lys Lys Lys Ile 420 425 430 Glu Glu Ile Asp Glu Asn Ile Leu Ala Asp Tyr Ile Cys Ser Arg Cys 435 440 445 Val Tyr Leu Phe Asp Leu Lys Glu Lys Leu Met Asn Phe Ile Glu Glu 450 455 460 Met Asp Met Lys Lys Leu Leu Leu Glu Ile Glu Met Pro Leu Val Glu 465 470 475 480 Val Leu Lys Ser Met Glu Val Ser Gly Phe Thr Leu Asp Lys Glu Val 485 490 495 Leu Lys Glu Leu Ser Gln Lys Ile Asp Asp Arg Ile Gly Glu Ile Leu 500 505 510 Asp Lys Ile Tyr Lys Glu Ala Gly Tyr Gln Phe Asn Val Asn Ser Pro 515 520 525 Lys Gln Leu Ser Glu Phe Leu Phe Glu Lys Leu Asn Leu Pro Val Ile 530 535 540 Lys Lys Thr Lys Thr Gly Tyr Ser Thr Asp Ser Glu Val Leu Glu Gln 545 550 555 560 Leu Val Pro Tyr Asn Asp Ile Val Ser Asp Ile Ile Glu Tyr Arg Gln 565 570 575 Leu Thr Lys Leu Lys Ser Thr Tyr Ile Asp Gly Phe Leu Pro Leu Met 580 585 590 Asp Glu Asn Asn Arg Val His Ser Asn Phe Lys Gln Met Val Thr Ala 595 600 605 Thr Gly Arg Ile Ser Ser Thr Glu Pro Asn Leu Gln Asn Ile Pro Ile 610 615 620 Arg Glu Glu Phe Gly Arg Gln Ile Arg Arg Ala Phe Ile Pro Arg Ser 625 630 635 640 Arg Asp Gly Tyr Ile Val Ser Ala Asp Tyr Ser Gln Ile Glu Leu Arg 645 650 655 Val Leu Ala His Val Ser Gly Asp Glu Lys Leu Ile Glu Ser Phe Met 660 665 670 Asn Asn Glu Asp Ile His Leu Arg Thr Ala Ser Glu Val Phe Lys Val 675 680 685 Pro Met Glu Lys Val Thr Pro Glu Met Arg Arg Ala Ala Lys Ala Val 690 695 700 Asn Phe Gly Ile Ile Tyr Gly Ile Ser Asp Tyr Gly Leu Ser Arg Asp 705 710 715 720 Leu Lys Ile Ser Arg Lys Glu Ala Lys Glu Tyr Ile Asn Asn Tyr Phe 725 730 735 Glu Arg Tyr Lys Gly Val Lys Asp Tyr Ile Glu Lys Ile Val Arg Phe 740 745 750 Ala Lys Glu Asn Gly Tyr Val Thr Thr Ile Met Asn Arg Arg Arg Tyr 755 760 765 Ile Pro Glu Ile Asn Ser Arg Asn Phe Thr Gln Arg Ser Gln Ala Glu 770 775 780 Arg Leu Ala Met Asn Ala Pro Ile Gln Gly Ser Ala Ala Asp Ile Ile 785 790 795 800 Lys Met Ala Met Val Lys Val Tyr Asn Asp Leu Lys Lys Leu Lys Leu 805 810 815 Lys Ser Lys Leu Ile Leu Gln Val His Asp Glu Leu Val Val Asp Thr 820 825 830 Tyr Lys Asp Glu Val Asp Ile Ile Lys Lys Ile Leu Lys Glu Asn Met 835 840 845 Glu Asn Val Val Gln Leu Lys Val Pro Leu Val Val Glu Ile Gly Val 850 855 860 Gly Pro Asn Trp Phe Leu Ala Lys 865 870 4 1737 DNA Thermoanaerobacter thermohydrosulfuricus 4 atgaaagttg aaaaatggtc aaaagtagat ataaaagaat tagtaacttt gttgcaagat 60 aacagaaata ttgcttttta cccgttaatt tatgaagggg aaataaaaaa aatagccttt 120 tcttttggaa aggatacggt ttatattgac gttttccaaa cagaagattt aaaggagatt 180 tttgaaaaag aagattttga atttacaacc catgaaataa aggatttttt agtgaggctt 240 tcttataaag gaatagagtg taaaagcaag tacatagata ctgctgtaat ggcttatctt 300 ctgaatcctt ctgagtctaa ctatgactta gaccgtgtgc taaaaaaata tttaaaggta 360 gatgtgcctt cttatgaagg aatatttggc aaaggtaggg ataaaaagaa aattgaagag 420 attgacgaaa acatacttgc tgattatatt tgcagtagat gtgtgtatct atttgattta 480 aaagaaaagc tgatgaattt tattgaagag atggatatga aaaaacttct attagaaata 540 gaaatgcctc ttgtagaagt tttaaaatca atggaggtaa gtggttttac attggataaa 600 gaagttctaa aagagctttc acaaaagata gatgatagaa taggagaaat actagataaa 660 atttataaag aggcaggata tcaatttaat gtaaattcac ctaagcaatt aagtgaattt 720 ttgtttgaaa agttaaactt accagtaata aagaaaacaa aaacaggata ctctacggat 780 tctgaagttt tggaacaatt ggttccttat aatgatattg tcagcgatat aatagagtat 840 cggcaactta caaaacttaa atctacttat atagatggat ttttgcctct tatggatgaa 900 aacaatagag tacattctaa ttttaaacaa atggttactg ctacaggtag aataagcagc 960 accgagccaa atctacaaaa tatacctata agagaagagt ttggcagaca aattagaagg 1020 gcttttattc cgaggagtag agatggatat attgtttcag cagattattc tcagattgaa 1080 ctgagggttt tagcacatgt ttcgggagat gaaaagctaa tagaatcttt tatgaataat 1140 gaagatatac atttaaggac agcttcggag gtttttaaag ttcctatgga aaaagttaca 1200 ccggagatga gaagagcagc aaaagccgta aattttggca taatatatgg cataagcgat 1260 tatgggcttt ctcgagacct taaaatatca agaaaagaag caaaagagta cataaataat 1320 tattttgaaa gatataaagg agtaaaagat tatattgaaa aaatagtacg atttgcaaaa 1380 gaaaatggct atgtgactac aataatgaac agaaggagat atattcctga aataaactca 1440 agaaatttta ctcaaagatc gcaggccgaa aggttagcaa tgaatgctcc gatacaggga 1500 agtgcggctg atataataaa aatggcaatg gttaaggtat acaacgattt aaaaaaatta 1560 aagcttaagt ctaagcttat attgcaagtt catgacgagc ttgtagtgga tacttataag 1620 gatgaagtag atatcataaa aaagatactt aaagaaaata tggaaaatgt agtgcaatta 1680 aaagttcctc tggttgttga aattggcgta gggcctaatt ggtttttggc caagtga 1737 5 872 PRT Thermoanaerobacter thermohydrosulfuricus 5 Met Tyr Lys Phe Leu Ile Ile Asp Gly Ser Ser Leu Met Tyr Arg Ala 1 5 10 15 Tyr Tyr Ala Leu Pro Met Leu Thr Thr Ser Glu Gly Leu Pro Thr Asn 20 25 30 Ala Leu Tyr Gly Phe Thr Met Met Leu Ile Lys Leu Ile Glu Glu Glu 35 40 45 Lys Pro Asp Tyr Ile Ala Ile Ala Phe Asp Lys Lys Ala Pro Thr Phe 50 55 60 Arg His Lys Glu Tyr Gln Asp Tyr Lys Ala Thr Arg Gln Ala Met Pro 65 70 75 80 Glu Glu Leu Ala Glu Gln Val Asp Tyr Leu Lys Glu Ile Ile Asp Gly 85 90 95 Phe Asn Ile Lys Thr Leu Glu Leu Glu Gly Tyr Glu Ala Asp Asp Ile 100 105 110 Ile Gly Thr Ile Ser Lys Leu Ala Glu Glu Lys Gly Met Glu Val Leu 115 120 125 Val Val Thr Gly Asp Arg Asp Ala Leu Gln Leu Val Ser Asp Lys Val 130 135 140 Lys Ile Lys Ile Ser Lys Lys Gly Ile Thr Gln Met Glu Glu Phe Asp 145 150 155 160 Glu Lys Ala Ile Leu Glu Arg Tyr Gly Ile Thr Pro Gln Gln Phe Ile 165 170 175 Asp Leu Lys Gly Leu Met Gly Asp Lys Ser Asp Asn Ile Pro Gly Val 180 185 190 Pro Asn Ile Gly Glu Lys Thr Ala Ile Lys Leu Leu Lys Asp Phe Gly 195 200 205 Thr Ile Glu Asn Leu Ile Gln Asn Leu Ser Gln Leu Lys Gly Lys Ile 210 215 220 Lys Glu Asn Ile Glu Asn Asn Lys Glu Leu Ala Ile Met Ser Lys Arg 225 230 235 240 Leu Ala Thr Ile Lys Arg Asp Ile Pro Ile Glu Ile Asp Phe Glu Glu 245 250 255 Tyr Lys Val Lys Lys Phe Asn Glu Glu Lys Leu Leu Glu Leu Phe Asn 260 265 270 Lys Leu Glu Phe Phe Ser Leu Ile Asp Asn Ile Lys Lys Glu Ser Ser 275 280 285 Ile Glu Ile Val Asp Asn His Lys Val Glu Lys Trp Ser Lys Val Asp 290 295 300 Ile Lys Glu Leu Val Thr Leu Leu Gln Asp Asn Arg Asn Ile Ala Phe 305 310 315 320 Tyr Pro Leu Ile Tyr Glu Gly Glu Ile Lys Lys Ile Ala Phe Ser Phe 325 330 335 Gly Lys Asp Thr Val Tyr Ile Asp Val Phe Gln Thr Glu Asp Leu Lys 340 345 350 Glu Ile Phe Glu Lys Glu Asp Phe Glu Phe Thr Thr His Glu Ile Lys 355 360 365 Asp Phe Leu Val Arg Leu Ser Tyr Lys Gly Ile Glu Cys Lys Ser Lys 370 375 380 Tyr Ile Asp Thr Ala Val Met Ala Tyr Leu Leu Asn Pro Ser Glu Ser 385 390 395 400 Asn Tyr Asp Leu Asp Arg Val Leu Lys Lys Tyr Leu Lys Val Asp Val 405 410 415 Pro Ser Tyr Glu Gly Ile Phe Gly Lys Gly Arg Asp Lys Lys Lys Ile 420 425 430 Glu Glu Ile Asp Glu Asn Ile Leu Ala Asp Tyr Ile Cys Ser Arg Cys 435 440 445 Val Tyr Leu Phe Asp Leu Lys Glu Lys Leu Met Asn Phe Ile Glu Glu 450 455 460 Met Asp Met Lys Lys Leu Leu Leu Glu Ile Glu Met Pro Leu Val Glu 465 470 475 480 Val Leu Lys Ser Met Glu Val Ser Gly Phe Thr Leu Asp Lys Glu Val 485 490 495 Leu Lys Glu Leu Ser Gln Lys Ile Asp Asp Arg Ile Gly Glu Ile Leu 500 505 510 Asp Lys Ile Tyr Lys Glu Ala Gly Tyr Gln Phe Asn Val Asn Ser Pro 515 520 525 Lys Gln Leu Ser Glu Phe Leu Phe Glu Lys Leu Asn Leu Pro Val Ile 530 535 540 Lys Lys Thr Lys Thr Gly Tyr Ser Thr Asp Ser Glu Val Leu Glu Gln 545 550 555 560 Leu Val Pro Tyr Asn Asp Ile Val Ser Asp Ile Ile Glu Tyr Arg Gln 565 570 575 Leu Thr Lys Leu Lys Ser Thr Tyr Ile Asp Gly Phe Leu Pro Leu Met 580 585 590 Asp Glu Asn Asn Arg Val His Ser Asn Phe Lys Gln Met Val Thr Ala 595 600 605 Thr Gly Arg Ile Ser Ser Thr Glu Pro Asn Leu Gln Asn Ile Pro Ile 610 615 620 Arg Glu Glu Phe Gly Arg Gln Ile Arg Arg Ala Phe Ile Pro Arg Ser 625 630 635 640 Arg Asp Gly Tyr Ile Val Ser Ala Asp Tyr Ser Gln Ile Glu Leu Arg 645 650 655 Val Leu Ala His Val Ser Gly Asp Glu Lys Leu Ile Glu Ser Phe Met 660 665 670 Asn Asn Glu Asp Ile His Leu Arg Thr Ala Ser Glu Val Phe Lys Val 675 680 685 Pro Met Glu Lys Val Thr Pro Glu Met Arg Arg Ala Ala Lys Ala Val 690 695 700 Asn Tyr Gly Ile Ile Tyr Gly Ile Ser Asp Tyr Gly Leu Ser Arg Asp 705 710 715 720 Leu Lys Ile Ser Arg Lys Glu Ala Lys Glu Tyr Ile Asn Asn Tyr Phe 725 730 735 Glu Arg Tyr Lys Gly Val Lys Asp Tyr Ile Glu Lys Ile Val Arg Phe 740 745 750 Ala Lys Glu Asn Gly Tyr Val Thr Thr Ile Met Asn Arg Arg Arg Tyr 755 760 765 Ile Pro Glu Ile Asn Ser Arg Asn Phe Thr Gln Arg Ser Gln Ala Glu 770 775 780 Arg Leu Ala Met Asn Ala Pro Ile Gln Gly Ser Ala Ala Asp Ile Ile 785 790 795 800 Lys Met Ala Met Val Lys Val Tyr Asn Asp Leu Lys Lys Leu Lys Leu 805 810 815 Lys Ser Lys Leu Ile Leu Gln Val His Asp Glu Leu Val Val Asp Thr 820 825 830 Tyr Lys Asp Glu Val Asp Ile Ile Lys Lys Ile Leu Lys Glu Asn Met 835 840 845 Glu Asn Val Val Gln Leu Lys Val Pro Leu Val Val Glu Ile Gly Val 850 855 860 Gly Pro Asn Trp Phe Leu Ala Lys 865 870 6 1737 DNA Thermoanaerobacter thermohydrosulfuricus 6 atgaaagttg aaaaatggtc aaaagtagat ataaaagaat tagtaacttt gttgcaagat 60 aacagaaata ttgcttttta cccgttaatt tatgaagggg aaataaaaaa aatagccttt 120 tcttttggaa aggatacggt ttatattgac gttttccaaa cagaagattt aaaggagatt 180 tttgaaaaag aagattttga atttacaacc catgaaataa aggatttttt agtgaggctt 240 tcttataaag gaatagagtg taaaagcaag tacatagata ctgctgtaat ggcttatctt 300 ctgaatcctt ctgagtctaa ctatgactta gaccgtgtgc taaaaaaata tttaaaggta 360 gatgtgcctt cttatgaagg aatatttggc aaaggtaggg ataaaaagaa aattgaagag 420 attgacgaaa acatacttgc tgattatatt tgcagtagat gtgtgtatct atttgattta 480 aaagaaaagc tgatgaattt tattgaagag atggatatga aaaaacttct attagaaata 540 gaaatgcctc ttgtagaagt tttaaaatca atggaggtaa gtggttttac attggataaa 600 gaagttctaa aagagctttc acaaaagata gatgatagaa taggagaaat actagataaa 660 atttataaag aggcaggata tcaatttaat gtaaattcac ctaagcaatt aagtgaattt 720 ttgtttgaaa agttaaactt accagtaata aagaaaacaa aaacaggata ctctacggat 780 tctgaagttt tggaacaatt ggttccttat aatgatattg tcagcgatat aatagagtat 840 cggcaactta caaaacttaa atctacttat atagatggat ttttgcctct tatggatgaa 900 aacaatagag tacattctaa ttttaaacaa atggttactg ctacaggtag aataagcagc 960 accgagccaa atctacaaaa tatacctata agagaagagt ttggcagaca aattagaagg 1020 gcttttattc cgaggagtag agatggatat attgtttcag cagattattc tcagattgaa 1080 ctgagggttt tagcacatgt ttcgggagat gaaaagctaa tagaatcttt tatgaataat 1140 gaagatatac atttaaggac agcttcggag gtttttaaag ttcctatgga aaaagttaca 1200 ccggagatga gaagagcagc aaaagccgta aattatggca taatatatgg cataagcgat 1260 tatgggcttt ctcgagacct taaaatatca agaaaagaag caaaagagta cataaataat 1320 tattttgaaa gatataaagg agtaaaagat tatattgaaa aaatagtacg atttgcaaaa 1380 gaaaatggct atgtgactac aataatgaac agaaggagat atattcctga aataaactca 1440 agaaatttta ctcaaagatc gcaggccgaa aggttagcaa tgaatgctcc gatacaggga 1500 agtgcggctg atataataaa aatggcaatg gttaaggtat acaacgattt aaaaaaatta 1560 aagcttaagt ctaagcttat attgcaagtt catgacgagc ttgtagtgga tacttataag 1620 gatgaagtag atatcataaa aaagatactt aaagaaaata tggaaaatgt agtgcaatta 1680 aaagttcctc tggttgttga aattggcgta gggcctaatt ggtttttggc caagtga 1737 7 872 PRT Thermoanaerobacter thermohydrosulfuricus 7 Met Tyr Lys Phe Leu Ile Ile Asp Gly Ser Ser Leu Met Tyr Arg Ala 1 5 10 15 Tyr Tyr Ala Leu Pro Met Leu Thr Thr Ser Glu Gly Leu Pro Thr Asn 20 25 30 Ala Leu Tyr Gly Phe Thr Met Met Leu Ile Lys Leu Ile Glu Glu Glu 35 40 45 Lys Pro Asp Tyr Ile Ala Ile Ala Phe Asp Lys Lys Ala Pro Thr Phe 50 55 60 Arg His Lys Glu Tyr Gln Asp Tyr Lys Ala Thr Arg Gln Ala Met Pro 65 70 75 80 Glu Glu Leu Ala Glu Gln Val Asp Tyr Leu Lys Glu Ile Ile Asp Gly 85 90 95 Phe Asn Ile Lys Thr Leu Glu Leu Glu Gly Tyr Glu Ala Asp Asp Ile 100 105 110 Ile Gly Thr Ile Ser Lys Leu Ala Glu Glu Lys Gly Met Glu Val Leu 115 120 125 Val Val Thr Gly Asp Arg Asp Ala Leu Gln Leu Val Ser Asp Lys Val 130 135 140 Lys Ile Lys Ile Ser Lys Lys Gly Ile Thr Gln Met Glu Glu Phe Asp 145 150 155 160 Glu Lys Ala Ile Leu Glu Arg Tyr Gly Ile Thr Pro Gln Gln Phe Ile 165 170 175 Asp Leu Lys Gly Leu Met Gly Asp Lys Ser Asp Asn Ile Pro Gly Val 180 185 190 Pro Asn Ile Gly Glu Lys Thr Ala Ile Lys Leu Leu Lys Asp Phe Gly 195 200 205 Thr Ile Glu Asn Leu Ile Gln Asn Leu Ser Gln Leu Lys Gly Lys Ile 210 215 220 Lys Glu Asn Ile Glu Asn Asn Lys Glu Leu Ala Ile Met Ser Lys Arg 225 230 235 240 Leu Ala Thr Ile Lys Arg Asp Ile Pro Ile Glu Ile Asp Phe Glu Glu 245 250 255 Tyr Lys Val Lys Lys Phe Asn Glu Glu Lys Leu Leu Glu Leu Phe Asn 260 265 270 Lys Leu Glu Phe Phe Ser Leu Ile Asp Asn Ile Lys Lys Glu Ser Ser 275 280 285 Ile Glu Ile Val Asp Asn His Lys Val Glu Lys Trp Ser Lys Val Asp 290 295 300 Ile Lys Glu Leu Val Thr Leu Leu Gln Asp Asn Arg Asn Ile Ala Phe 305 310 315 320 Tyr Pro Leu Ile Tyr Glu Gly Glu Ile Lys Lys Ile Ala Phe Ser Phe 325 330 335 Gly Lys Asp Thr Val Tyr Ile Asp Val Phe Gln Thr Glu Asp Leu Lys 340 345 350 Glu Ile Phe Glu Lys Glu Asp Phe Glu Phe Thr Thr His Glu Ile Lys 355 360 365 Asp Phe Leu Val Arg Leu Ser Tyr Lys Gly Ile Glu Cys Lys Ser Lys 370 375 380 Tyr Ile Asp Thr Ala Val Met Ala Tyr Leu Leu Asn Pro Ser Glu Ser 385 390 395 400 Asn Tyr Asp Leu Asp Arg Val Leu Lys Lys Tyr Leu Lys Val Asp Val 405 410 415 Pro Ser Tyr Glu Gly Ile Phe Gly Lys Gly Arg Asp Lys Lys Lys Ile 420 425 430 Glu Glu Ile Asp Glu Asn Ile Leu Ala Asp Tyr Ile Cys Ser Arg Cys 435 440 445 Val Tyr Leu Phe Asp Leu Lys Glu Lys Leu Met Asn Phe Ile Glu Glu 450 455 460 Met Asp Met Lys Lys Leu Leu Leu Glu Ile Glu Met Pro Leu Val Glu 465 470 475 480 Val Leu Lys Ser Met Glu Val Ser Gly Phe Thr Leu Asp Lys Glu Val 485 490 495 Leu Lys Glu Leu Ser Gln Lys Ile Asp Asp Arg Ile Gly Glu Ile Leu 500 505 510 Asp Lys Ile Tyr Lys Glu Ala Gly Tyr Gln Phe Asn Val Asn Ser Pro 515 520 525 Lys Gln Leu Ser Glu Phe Leu Phe Glu Lys Leu Asn Leu Pro Val Ile 530 535 540 Lys Lys Thr Lys Thr Gly Tyr Ser Thr Asp Ser Glu Val Leu Glu Gln 545 550 555 560 Leu Val Pro Tyr Asn Asp Ile Val Ser Asp Ile Ile Glu Tyr Arg Gln 565 570 575 Leu Thr Lys Leu Lys Ser Thr Tyr Ile Asp Gly Phe Leu Pro Leu Met 580 585 590 Asp Glu Asn Asn Arg Val His Ser Asn Phe Lys Gln Met Val Thr Ala 595 600 605 Thr Gly Arg Ile Ser Ser Thr Glu Pro Asn Leu Gln Asn Ile Pro Ile 610 615 620 Arg Glu Glu Phe Gly Arg Gln Ile Arg Arg Ala Phe Ile Pro Arg Ser 625 630 635 640 Arg Asp Gly Tyr Ile Val Ser Ala Asp Tyr Ser Gln Ile Glu Leu Arg 645 650 655 Val Leu Ala His Val Ser Gly Asp Glu Lys Leu Ile Glu Ser Phe Met 660 665 670 Asn Asn Glu Asp Ile His Leu Arg Thr Ala Ser Glu Val Phe Lys Val 675 680 685 Pro Met Glu Lys Val Thr Pro Glu Met Arg Arg Ala Ala Lys Ala Val 690 695 700 Asn Tyr Gly Ile Ile Tyr Gly Ile Ser Asp Tyr Gly Leu Ser Arg Arg 705 710 715 720 Leu Lys Ile Ser Arg Lys Glu Ala Lys Glu Tyr Ile Asn Asn Tyr Phe 725 730 735 Glu Arg Tyr Lys Gly Val Lys Asp Tyr Ile Glu Lys Ile Val Arg Phe 740 745 750 Ala Lys Glu Asn Gly Tyr Val Thr Thr Ile Met Asn Arg Arg Arg Tyr 755 760 765 Ile Pro Glu Ile Asn Ser Arg Asn Phe Thr Gln Arg Ser Gln Ala Glu 770 775 780 Arg Leu Ala Met Asn Ala Pro Ile Gln Gly Ser Ala Ala Asp Ile Ile 785 790 795 800 Lys Met Ala Met Val Lys Val Tyr Asn Asp Leu Lys Lys Leu Lys Leu 805 810 815 Lys Ser Lys Leu Ile Leu Gln Val His Asp Glu Leu Val Val Asp Thr 820 825 830 Tyr Lys Asp Glu Val Asp Ile Ile Lys Lys Ile Leu Lys Glu Asn Met 835 840 845 Glu Asn Val Val Gln Leu Lys Val Pro Leu Val Val Glu Ile Gly Val 850 855 860 Gly Pro Asn Trp Phe Leu Ala Lys 865 870 8 872 PRT Thermoanaerobacter thermohydrosulfuricus 8 Met Tyr Lys Phe Leu Ile Ile Asp Gly Ser Ser Leu Met Tyr Arg Ala 1 5 10 15 Tyr Tyr Ala Leu Pro Met Leu Thr Thr Ser Glu Gly Leu Pro Thr Asn 20 25 30 Ala Leu Tyr Gly Phe Thr Met Met Leu Ile Lys Leu Ile Glu Glu Glu 35 40 45 Lys Pro Asp Tyr Ile Ala Ile Ala Phe Asp Lys Lys Ala Pro Thr Phe 50 55 60 Arg His Lys Glu Tyr Gln Asp Tyr Lys Ala Thr Arg Gln Ala Met Pro 65 70 75 80 Glu Glu Leu Ala Glu Gln Val Asp Tyr Leu Lys Glu Ile Ile Asp Gly 85 90 95 Phe Asn Ile Lys Thr Leu Glu Leu Glu Gly Tyr Glu Ala Asp Asp Ile 100 105 110 Ile Gly Thr Ile Ser Lys Leu Ala Glu Glu Lys Gly Met Glu Val Leu 115 120 125 Val Val Thr Gly Asp Arg Asp Ala Leu Gln Leu Val Ser Asp Lys Val 130 135 140 Lys Ile Lys Ile Ser Lys Lys Gly Ile Thr Gln Met Glu Glu Phe Asp 145 150 155 160 Glu Lys Ala Ile Leu Glu Arg Tyr Gly Ile Thr Pro Gln Gln Phe Ile 165 170 175 Asp Leu Lys Gly Leu Met Gly Asp Lys Ser Asp Asn Ile Pro Gly Val 180 185 190 Pro Asn Ile Gly Glu Lys Thr Ala Ile Lys Leu Leu Lys Asp Phe Gly 195 200 205 Thr Ile Glu Asn Leu Ile Gln Asn Leu Ser Gln Leu Lys Gly Lys Ile 210 215 220 Lys Glu Asn Ile Glu Asn Asn Lys Glu Leu Ala Ile Met Ser Lys Arg 225 230 235 240 Leu Ala Thr Ile Lys Arg Asp Ile Pro Ile Glu Ile Asp Phe Glu Glu 245 250 255 Tyr Lys Val Lys Lys Phe Asn Glu Glu Lys Leu Leu Glu Leu Phe Asn 260 265 270 Lys Leu Glu Phe Phe Ser Leu Ile Asp Asn Ile Lys Lys Glu Ser Ser 275 280 285 Ile Glu Ile Val Asp Asn His Lys Val Glu Lys Trp Ser Lys Val Asp 290 295 300 Ile Lys Glu Leu Val Thr Leu Leu Gln Asp Asn Arg Asn Ile Ala Phe 305 310 315 320 Tyr Pro Leu Ile Tyr Glu Gly Glu Ile Lys Lys Ile Ala Phe Ser Phe 325 330 335 Gly Lys Asp Thr Val Tyr Ile Asp Val Phe Gln Thr Glu Asp Leu Lys 340 345 350 Glu Ile Phe Glu Lys Glu Asp Phe Glu Phe Thr Thr His Glu Ile Lys 355 360 365 Asp Phe Leu Val Arg Leu Ser Tyr Lys Gly Ile Glu Cys Lys Ser Lys 370 375 380 Tyr Ile Asp Thr Ala Val Met Ala Tyr Leu Leu Asn Pro Ser Glu Ser 385 390 395 400 Asn Tyr Asp Leu Asp Arg Val Leu Lys Lys Tyr Leu Lys Val Asp Val 405 410 415 Pro Ser Tyr Glu Gly Ile Phe Gly Lys Gly Arg Asp Lys Lys Lys Ile 420 425 430 Glu Glu Ile Asp Glu Asn Ile Leu Ala Asp Tyr Ile Cys Ser Arg Cys 435 440 445 Val Tyr Leu Phe Asp Leu Lys Glu Lys Leu Met Asn Phe Ile Glu Glu 450 455 460 Met Asp Met Lys Lys Leu Leu Leu Glu Ile Glu Met Pro Leu Val Glu 465 470 475 480 Val Leu Lys Ser Met Glu Val Ser Gly Phe Thr Leu Asp Lys Glu Val 485 490 495 Leu Lys Glu Leu Ser Gln Lys Ile Asp Asp Arg Ile Gly Glu Ile Leu 500 505 510 Asp Lys Ile Tyr Lys Glu Ala Gly Tyr Gln Phe Asn Val Asn Ser Pro 515 520 525 Lys Gln Leu Ser Glu Phe Leu Phe Glu Lys Leu Asn Leu Pro Val Ile 530 535 540 Lys Lys Thr Lys Thr Gly Tyr Ser Thr Asp Ser Glu Val Leu Glu Gln 545 550 555 560 Leu Val Pro Tyr Asn Asp Ile Val Ser Asp Ile Ile Glu Tyr Arg Gln 565 570 575 Leu Thr Lys Leu Lys Ser Thr Tyr Ile Asp Gly Phe Leu Pro Leu Met 580 585 590 Asp Glu Asn Asn Arg Val His Ser Asn Phe Lys Gln Met Val Thr Ala 595 600 605 Thr Gly Arg Ile Ser Ser Thr Glu Pro Asn Leu Gln Asn Ile Pro Ile 610 615 620 Arg Glu Glu Phe Gly Arg Gln Ile Arg Arg Ala Phe Ile Pro Arg Ser 625 630 635 640 Arg Asp Gly Tyr Ile Val Ser Ala Asp Tyr Ser Gln Ile Glu Leu Arg 645 650 655 Val Leu Ala His Val Ser Gly Asp Glu Lys Leu Ile Glu Ser Phe Met 660 665 670 Asn Asn Glu Asp Ile His Leu Arg Thr Ala Ser Glu Val Phe Lys Val 675 680 685 Pro Met Glu Lys Val Thr Pro Glu Met Arg Arg Ala Ala Lys Ala Val 690 695 700 Asn Phe Gly Ile Ile Tyr Gly Ile Ser Asp Tyr Gly Leu Ser Arg Asp 705 710 715 720 Leu Lys Ile Ser Arg Lys Glu Ala Lys Glu Tyr Ile Asn Asn Tyr Phe 725 730 735 Glu Arg Tyr Lys Gly Val Lys Asp Tyr Ile Glu Lys Ile Val Arg Phe 740 745 750 Ala Lys Glu Asn Gly Tyr Val Thr Thr Ile Met Asn Arg Arg Arg Tyr 755 760 765 Ile Pro Glu Ile Asn Ser Arg Asn Phe Thr Gln Arg Ser Gln Ala Glu 770 775 780 Arg Leu Ala Met Asn Ala Pro Ile Gln Gly Ser Ala Ala Asp Ile Ile 785 790 795 800 Lys Met Ala Met Val Lys Val Tyr Asn Asp Leu Lys Lys Leu Lys Leu 805 810 815 Lys Ser Lys Leu Ile Leu Gln Val His Asp Glu Leu Val Val Asp Thr 820 825 830 Tyr Lys Asp Glu Val Asp Ile Ile Lys Lys Ile Leu Lys Glu Asn Met 835 840 845 Glu Asn Val Val Gln Leu Lys Val Pro Leu Val Val Glu Ile Gly Val 850 855 860 Gly Pro Asn Trp Phe Leu Ala Lys 865 870 9 27 DNA Artificial Sequence Description of Artificial Sequence Primer 9 gggctttctc gacgccttaa aatatca 27 10 27 DNA Artificial Sequence Description of Artificial Sequence Primer 10 gccgtaaatt ttggcataat atatggc 27 11 50 DNA Thermoanaerobacter thermohydrosulfuricus 11 caggctgcct atcagaaggt ggtggctggt gtggccaatg ccctggctca 50 12 27 DNA Artificial Sequence Description of Artificial Sequence Primer 12 ggcattggcc acaccagcca ccacctt 27 13 50 DNA Thermoanaerobacter thermohydrosulfuricus 13 caggctgcct atcagaaggt ggtggctggt gtggccaatg ccctggctca 50 14 29 DNA Artificial Sequence Description of Artificial Sequence Primer 14 ggcattggcc acaccagcca ccaccttct 29 15 50 DNA Thermoanaerobacter thermohydrosulfuricus 15 caggctgcct atcagaaggt ggtggctggt gtggccaatg ccctggctca 50 16 30 DNA Artificial Sequence Description of Artificial Sequence Primer 16 ggcattggcc acaccagcca ccaccttctg 30 17 50 DNA Thermoanaerobacter thermohydrosulfuricus 17 caggctgcct atcagaaggt ggtggctggt gtggccaatg ccctggctca 50 18 28 DNA Artificial Sequence Description of Artificial Sequence Primer 18 ggcattggcc acaccagcca ccaccttc 28 19 50 DNA Thermoanaerobacter thermohydrosulfuricus 19 caggctgcct atcagaaggt ggtggctggt gtggccaatg ccctggctca 50 20 27 DNA Artificial Sequence Description of Artificial Sequence Primer 20 ggcattggcc acaccagcca ccacctt 27 21 50 DNA Thermoanaerobacter thermohydrosulfuricus 21 caggctgcct atcagaaggt ggtggctggt gtggccaatg ccctggctca 50 22 27 DNA Artificial Sequence Description of Artificial Sequence Primer 22 ggcattggcc acaccagcca ccacctt 27 23 40 PRT Thermotoga maritima 23 Asn Val Lys Pro Glu Glu Val Thr Glu Glu Met Arg Arg Ala Gly Lys 1 5 10 15 Met Val Asn Phe Ser Ile Ile Tyr Gly Val Thr Pro Tyr Gly Leu Ser 20 25 30 Val Arg Leu Gly Val Pro Val Lys 35 40 24 40 PRT Thermotoga neapolitana 24 Asn Val Lys Pro Glu Glu Val Asn Glu Glu Met Arg Arg Val Gly Lys 1 5 10 15 Met Val Asn Phe Ser Ile Ile Tyr Gly Val Thr Pro Tyr Gly Leu Ser 20 25 30 Val Arg Leu Gly Ile Pro Val Lys 35 40 25 40 PRT Thermotoga subterranea 25 Gly Val Ser Glu Met Phe Val Ser Glu Gln Met Arg Arg Val Gly Lys 1 5 10 15 Met Val Asn Phe Ala Ile Ile Tyr Gly Val Ser Pro Tyr Gly Leu Ser 20 25 30 Lys Arg Ile Gly Leu Ser Val Ser 35 40 26 40 PRT Escherichia coli 26 Gly Leu Pro Leu Glu Thr Val Thr Ser Glu Gln Arg Arg Ser Ala Lys 1 5 10 15 Ala Ile Asn Phe Gly Leu Ile Tyr Gly Met Ser Ala Phe Gly Leu Ala 20 25 30 Arg Gln Leu Asn Ile Pro Arg Lys 35 40 27 40 PRT Thermus caldophilus 27 Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 1 5 10 15 Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 20 25 30 Gln Glu Leu Ala Ile Pro Tyr Glu 35 40 28 40 PRT Thermus filiformis 28 Gly Leu Asp Pro Ala Leu Val Asp Pro Lys Met Arg Arg Ala Ala Lys 1 5 10 15 Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 20 25 30 Gln Glu Leu Gly Ile Asp Tyr Lys 35 40 29 40 PRT Thermus flavus 29 Gly Val Ser Pro Glu Gly Val Asp Pro Leu Met Arg Arg Ala Ala Lys 1 5 10 15 Thr Ile Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 20 25 30 Gly Glu Leu Ser Ile Pro Tyr Glu 35 40 30 40 PRT Thermus thermophilus 30 Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 1 5 10 15 Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 20 25 30 Gln Glu Leu Ala Ile Pro Tyr Glu 35 40 31 40 PRT Thermus aquaticus 31 Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 1 5 10 15 Thr Ile Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 20 25 30 Gln Glu Leu Ala Ile Pro Tyr Glu 35 40 32 40 PRT Thermoanaerobacter thermohydrosulfuricus 32 Lys Val Pro Met Glu Lys Val Thr Pro Glu Met Arg Arg Ala Ala Lys 1 5 10 15 Ala Val Asn Phe Gly Ile Ile Tyr Gly Ile Ser Asp Tyr Gly Leu Ser 20 25 30 Arg Asp Leu Lys Ile Ser Arg Lys 35 40 

What is claimed is:
 1. An enzymatically active DNA polymerase or active fragment thereof, having at least 80% identity in its amino acid sequence to the DNA polymerase of Thermoanaerobacter thermohydrosulfuricus or a fragment thereof, and having an amino acid alteration at position 720 in Tts Pol I or at position 426 in ΔTts Pol I or at a homologous position defined with respect to Tts DNA ploymerase I, having improved incorporation of nucleotide analogs and natural bases during DNA synthesis compared to unaltered enzyme.
 2. The polymerase of claim 1 wherein said nucleotide analogs are dNTP, ddNTP and rNTP analogs.
 3. The polymerase of claim 2 wherein said dNTP, ddNTP and rNTP analogs are dye-conjugated or biotin conjugated dNTP, ddNTP and rNTP.
 4. The polymerase of claim 3 wherein the dye in said dye-conjugated dNTP, ddNTP and rNTP is a rhodamine or Cyanine derivative dye.
 5. The polymerase of claim 4 wherein said rhodamine dye is R110, R6G, TMR or Rox.
 6. The polymerase of claim 4 wherein said Cyanine derivative dye is Cy3, Cy3.5, Cy5.0 or Cy5.5.
 7. The polymerase of claim 1, wherein said polymerase has the aspartate at position 720 in Tts Pol I or at position 426 in ΔTts Pol I, replaced with arginine.
 8. Method of performing direct RNA sequencing utilizing the polymerase of claim
 1. 9. Method of incorporating Cy3 and Cy5 dye conjugated dNTPs across a range of reaction temperatures from 37-65 C. utilizing the polymerase of claim
 1. 10. Kit for labeling a polynucleotide from a DNA or RNA template with a DNA or RNA primer comprising a DNA polymerase of claim
 1. 